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(S) Vfllin gene promoter sequence and its use in vectors, transformed mammalian cell lines, transgenic 
animals, and cell lines derived from the animals. 



@ An isolated DIMA sequence encoding a promoter of the human villin gene is provided. The DNA 
sequence can be operatively linked to a nucleotide sequence of a gene, such as an oncogene, and 
incorporated in cloning and expression vectors. The vectors are useful in the production of transgenic 
non-human mammals as models of colorectal cancer. 
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Technical Fi Id 

This invention relates to a gene promoter sequence for r gulating gen expression in mammalian cells. 
This invention also relates to v ctors containing the promoter sequenc , cells transformed with the vectors, 
5 transgenic animals based on the vectors, and cell lines derived from cells in the animals. 

Background Art 

The ability to introduce genes into the germ line of mammals is of great interest in biology. The propensity 
10 of mammalian cells to take up exogenously added DNA and to express genes included in the DNA has been 
known for many years. The results of gene manipulation are inherited by the offspring of these animals. All 
cells of these offspring inherit the introduced gene as part of their genetic make-up. Such animals are said to 
be transgenic. 

Transgenic mammals have provided a means for studying gene regulation during embryogenesis and in 
15 differentiation, for studying the action of oncogenes, and for studying the intricate interaction of cells in the 
immune system. The whole animal is the ultimate assay system for manipulated genes, which direct complex 
biological processes. 

Transgenic animals can provide a general assay for functionally dissecting DNA sequences responsible 
for tissue specific or developmental regulation of a variety of genes. In addition, transgenic animals provide 
20 useful vehicles for expressing recombinant proteins and for generating precise animal models of human genetic 
disorders. 

By means of recombinant techniques, it is possible to make constructs containing oncogenes. Many 
oncogene constructs have been introduced into transgenic mice and have been found to elicit tumorigenic res- 
ponses. For example, mice that carry the SV40 enhancer and region coding for the large T-antigen reproducibly 

25 develop tumors of the choroid plexus, which are derived from the cells lining the ventricles of the brain. Paimiter 
et al., Nature 316:457-60 (1985). In similar experiments, the c-myc oncogene was fused to the LTR of mouse 
mammary tumor virus. In one transgenic line, females characteristically developed mammary carcinomas dur- 
ing the second or third pregnancy. Stewart et al., Cell 38:627-37 (1984). In other experiments, the c-myc 
oncogene was driven by immunoglobulin enhancers and gave rise to malignant lymphoid tumors in transgenic 

30 mice. Adams et al., Nature 318:533-8 (1985). When an oncogenic human ras gene was fused to an elastase 
promoter/enhancer construct, transgenic mice were bom with pancreatic neoplasms that appeared to be due 
to transformation of all of the differentiating pancreatic cells. Paimiter et al., Ann. Rev. Genet 20:465-99 (1986). 

For a general discussion of gene cloning and expression in animals and animal cells, see Old and Primrose, 
"Principles of Gene Manipulation," Blackwell Scientific Publications, London (1989), page 255 etseq. 

35 Transgenic lines, which have a predisposition to specific tumors, are of great value in the investigation of 
the events leading to malignant transformation. It is well known that the efficacy of an anti-cancer treatment is 
dependent on identification of the tumor cells that are the primary cause of the disease. This is especially true 
in the detection of proliferation of tumor cells of the gastrointestinal tract or of renal origin. Indeed, cancers of 
the digestive tract are widespread, and there is a critical need for effective treatment for these diseases. The 

40 discovery of the effective treatments can be expedited by providing an animal model that will lead to car- 
cinogenesis, which will enable the study of the efficacy, safety, and mode of action of anti-tumoral pharmaceuti- 
cal agents. 

In summary, there exists a need in the art for reagents and methods for providing transgenic animal models 
of human tumor growth and proliferation, including transgenic animal models of human tumors of the digestive 
45 tract and renal system. 

Disclosure of the Invention 

Accordingly, this invention aids in fulfilling these needs in the art Specifically, this invention provides an 
so isolated DNA sequence encoding or corresponding to a promoter of the human villin gene. The nucleotide sequ- 
ence consists essentially of : 
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This invention also provides a DNAsequ nee comprising the promoter of th human villin gene perativ ly 
linked to a nucleotide sequence of a structural gene or an oncog ne. In specific embodiments of th invention, 
55 th structural gene encodes villin or a reporter gene, such as the gene coding for luciferase. 

This invention further provides isolated RNA com pi mentary to th promot r s quence of th human villin 
gene and to the other DNA sequences describ d her in. 

In another embodiment of th invention, a eukaryotic v ctor is provided. Th vector comprises a plasmid 
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or viral vector containing the promot rs quence of the human villin g n . 

In addition, this inv ntion provides mammalian c Us transformed with the vectors of th invention. 

Also, this invention provides transgenic animals containing the promoter of the human villin gen and cell 
lines cultur d from cells of the transgenic animals. 

5 

Brief Description of the Drawings 

This invention will be more fully described with reference to the drawings in which: 
Fig. 1' is a representation of the structure of the isolated 5'-flanking region of the villin gene; 
10 Fig. 2 depicts a partial structure of the human villin gene showing the approximate location of introns and 
exons; 

Fig. 3 depicts the nucleotide sequence of a portion of human villin cDNA, including the location of the inter- 
vening sequences (IVS) in the 5'-first 900 base pairs (bp); 

Fig. 4'A-B depicts the 5'-region of the nucleotide sequence of DNA encoding a promoter of the human villin 
15 gene; 

Fig. 4"A-N depicts the nucleotide sequence of the villin gene promoter, including the sequence of com- 
plementary strands and restriction endonuclease sites; 

Fig. 5 depicts plasmid vector pSVOA and the luciferase gene employed in one embodiment of the invention; 
Fig. 6 is a diagrammatic representation of the procedure used to construct plasmid pPrVL of the invention; 
20 and 

Fig. 7 is a diagrammatic representation of the preparation of an animal model for colorectal cancers and 
cell cultures derived from animal tumors. 

Best Mode for Carrying Out the Invention 

25 

1. Gene Promoter Sequence 

This invention provides an isolated DNA sequence encoding a promoter of the human villin gene. Villin is 
a protein with a molecular weight of about 95,000 daltons and is normally present in the villi of the digestive 

30 mucosa, more especially in intestinal villi. Villin is able to bind to actin in the presence of calcium ions. 

Villin is usually found in brush borders, which are observed at the apex of enterocytes, particularly after 
they have attained their final stage of differentiation. The enterocytes line the entire intestinal lumen. Micro-villi 
forming the brush border are assembled at the final stage of differentiation of the enterocyte. Villin is localized 
in bundles of axial microfilaments of micro-villi. It contributes to the structure of their cytoskeleton. On examining 

35 frozen sections by immunofluorescence, it has been observed that villin is present at the apical pole of elon- 
gated cells forming columns at the surface of the internal wall of the intestine, and also close to cells of the 
proximal tubules of the kidney. In both cases, their regions are contiguous with the brush border of the cells 
concerned. 

Villin has not been detected in other types of micro-villi of various other epithelia cells when the micro-villi 
40 were not provided with a well-organized brush border. A. Bretscher et al., Exp. Cell Res., 135:213 (1981); H. 
Reggio et al., "Membranes in Growth and Development" by A. Liss, New York (1982) pp. 89-105. Furthermore, 
it has been observed that villin is present in enterocytes from an early stage of their development, well before 
the organization of the brush border. 

Villin can almost always be detected in vivo at all stages of the development of tumor cells in the digestive 
45 region, at both an early and later stages of tumorogenesis, irrespective of the state of differentiation of the cells 
concerned. In particularly, villin can be detected in cancers of the colon, one of the most widespread forms of 
cancer. Villin can also be detected in certain types of renal cancer, but has never been detected in primary 
tumor cells derived from cancers localized in the tissues of other organs, such as the liver, ovaries, and lungs. 
The nucleotide sequence and the amino acid sequence of the COOH-terrninal portion of human villin are 
so given below: 



55 
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AGTAGCTGTCCTTGTAAAGCAGTACCCTACCCTGATTGTAGGGTCTCATTTTCTCA 
CCGATATTAGTCCTACACCAATTGAAGTGAAATTTTGCAGATGTGCCTATGAGCAC 
AAACTTCTGTGGCAAATGCCAGTTTTGTTTAATAAATGTACCTATTCCTTCAGAAA 
GATGATAC CCCAAAAAAAAAAAA 



30 This nucleotide sequence and amino acid sequence identify a domani, which has been termed the headpiece 
(HP). 

The nucleotide sequence coding for the terminal portion of human villin can be isolated by means of the 
following procedure: 

- total mRNA (messenger RNAs) is prepared according to known techniques from a cell line expressing 
35 villin; 

- a cDNA (complementary DNA) bank is constructed; 

- these cDNAs are inserted into a vector capable of expressing the protein coded by the insert; 

- by means of the recombinant vectors obtained, a bacterial strain is transformed, then conditions permit- 
ting the expression of the desired protein in the bacteria are established; and 

40 - the recombinant clones containing the clone specific for villin is selected by means of antibodies recog- 
nizing villin. 

These techniques are described in more detail in copending U.S. application Serial No. 287,658, filed December 
21, 1988. The sequence of the entire human villin protein has been elucidated by Arpin et al., J. Cell Biol., 
107:1759-1766 (1988) and Pringault et al., EMBO J., 5:3119-3124 (1986). 

45 A plasmid containing the gene encoding human villin was deposited with the Collection Nationale de Cul- 
tures de Microorganismes (C.N.C.M.) of Institut Pasteur in Paris, France on November 1 3, 1985, under culture 
collection deposit Accession No. I -497. 

A portion of the operon encoding the human villin gene is diagrammatically shown in Fig. 1'. More particu- 
larly, Fig. V depicts the structure of an isolated 5'-flanking region of the operon for the human villin gene. Ref- 

50 erring to the Figure, a 2 kb region is shown proximate the 5'-end. This region contains the gene promoter of 
the invention. The CAP site and the ATG start codon of the gene encoding human villin are also shown in the 
Figure. 

It has b n discover d that the operon encoding human villin contains a numb r of xons and introns. The 
first exon is shown in Fig. 1 ' and is identified as "EXON r. A number of other exons and introns ar depicted 
55 in Fig. 2, which is a partial structur of an isolated 5'-fianking region of th op ron needing human villin mRNA. 
Exon 1 in Fig. 1' corresponds to the first exon comprising nucleotides 1-97 in Fig. 2. Introns ar identified by 
triangular regions extending toward the bottom of th Figur . It will be appar nt from Fig. 2 that the op ron 
ncoding the human villin gene contains a number of intervening sequ nc s. 
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The location of the intervening sequ nces (IVS) in a 5'-first 900 bp of human villin cDNA is shown in Fig. 
3. The IVS are d picted relative to th nucleotide sequence encoding human villin. Th sequ nee shown in 
Fig. 3 is the coding sequence (exons) of the villin gene. Once again, the CAP site and ATG start codon are 
shown in the Figure. 

5 The DNA sequence of the invention encodes a promoter of the human villin gene. The DNA sequence is 

shown in Figs. 4'A-B and 4"A-N. Fig. 4'A-B depicts a portion of the DNA in the 5'-region of the promoter, and 
includes the CAP site and the ATG codon of the villin gene. Fig. 4"A-N depicts the entire 1 991 bp DNA sequence 
encoding the promoter of the villin gene of the invention. The locations of restriction endonudease sites in the 
DNA sequence are indicated in Fig. 4 n A-N. 
10 The nucleotide sequence of the promoter of the invention was derived by dideoxynucleotide sequencing. 
The base sequences of the nucleotides are written in the 5' — > 3' direction. Each of the letters shown is a 
conventional designation for the following nucleotides: 
A Adenine 
G Guanine 
15 T Thymine 

C Cytosine. 

The DNA sequence encoding the promoter of the human villin gene can be isolated from its natural envi- 
ronment. It is preferred that the DNA sequence encoding the promoter of the human villin gene be in a purified 
form. For instance, the promoter can be free of human blood-derived proteins, human serum proteins, viral pro- 
20 teins, nucleotide sequences encoding these proteins, human tissue, human tissue components, or combi- 
nations of these substances. In addition, it is preferred that the promoter sequence is free of extraneous proteins 
and lipids, and adventitious microorganisms, such as bacteria and viruses. The essentially purified and isolated 
DNA sequence encoding the promoter is especially useful for preparing nucleotide probes and expression vec- 
tors. 

25 The promoter of the invention can also be prepared by the formation of 3' — > 5' phosphate linkages be- 
tween nucleoside units using conventional chemical synthesis techniques. For example, the well-known phos- 
phodiester, phosphotriester, and phosphite triester techniques, as well as known modifications of these 
approaches, can be employed. Deoxyribonucleotides can be prepared with automatic synthesis machines, 
such as those based on the phosphoramidite approach. Oligo- and polyribonucleotides can also be obtained 

30 with the aid of RNA ligase using conventional techniques. 

This invention of course includes variants of the DNA sequence of the invention exhibiting substantially 
the same properties as the promoter of the invention. By this it is meant that DNA sequences need not be iden- 
tical to the sequence disclosed herein. Variations can be attributable to single or multiple base substitutions, 
deletions, or insertions or local mutations involving one or more nucleotides not substantially detracting from 

35 the properties of the DNA sequence as a promoter for the human villin gene. It will also be understood that the 
present invention is intended to encompass fragments of the DNA sequence of the invention in purified form, 
where the fragments are capable of functioning as promoters for the human villin gene. For example, regulatory 
elements responsible for tissue specificity of villin can be characterized by a functional test using the Firefly 
luciferase gene as a reporter gene as described by de Wet et al., infra, 

40 The DNA sequence coding for the promoter of the present invention can be amplified in the well known 
polymerase chain reaction (PCR), which is useful for amplifying all or specific regions of the promoter. See e.g. , 
S. Kwok et al., J. Virol., 61:1690-1694 (1987); U.S. Patent 4,683,202; and U.S. Patent 4,683,195. More par- 
ticularly, DNA primers pairs of known sequence positioned 1 0-300 base pairs apart that are complementary to 
the plus and minus strands of the DNA to be amplified can be prepared by well known techniques for the synth- 

45 esis of oligonucleotides. One end of each primer can be extended and modified to create restriction en do nuc- 
lease sites when the primer is annealed to the DNA. The PCR reaction mixture can contain the DNA, the DNA 
primer pairs, four deoxyribonucleoside triphosphates, MgCI 2 , DNA polymerase, and conventional buffers. The 
DNA can be amplified for a number of cycles. It is generally possible to increase the sensitivity of detection by 
using a multiplicity of cycles, each cycle consisting of a short period of denaturation of the DNA at an elevated 

so temperature, cooling of the reaction mixture, and polymerization with the DNA polymerase. Amplified sequ- 
ences can be detected by the use of a technique termed oligomer restriction (OR). See , R. K. Saiki et al., 
Bio/Technology 3:1008-1012 (1985). 

The villin gene is strictly r gulated during embryonic d velopmentof th digestiv tract. Villin can be detec- 
ted in the visceral endoderm, in th primitiv gut, in pancreatic and hepatic anlag , as soon as th s tissues 

55 app ar. This expression is then stimulat d in differentiated ent rocytes, r mains low in biliary cells and duct 
cells, and shut off in hepatocytes and in acinar pancreatic cells. Th DNA s que nee encoding th promot r of 
the human villin g ne is thus us ful to investigate molecular genetics involved in digestive organogenesis. Th 
villin gene promoter strictly r gulates the expression of villin in a tissu sp cific manner, during adult i if and 
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during embryonic development 

2. Nucleotide Probes Containing the Gen Prompt r 

5 The DNA sequence of the invention coding for the promoter of the human viilin gene can also be used as 

a probe for the detection of a nucleotide sequence in a biological material, such as tissue or body fluids. The 
probe can be labeled with an atom or inorganic radical, most commonly using a radionuclide, but also perhaps 
with a heavy metal. Radioactive labels including ^P, 3 H, 14 C, or the like. Any radioactive label can be employed, 
which provides for an adequate signal and has sufficient half-life. Other labels include ligands that can serve 

10 as a specific binding member to a labeled antibody, fluorescers, chemiluminescers, enzymes, antibodies which 
can serve as a specific binding pair member for a labeled ligand, and the like. The choice of the label will be 
governed by the effect of the label on the rate of hybridization and binding of the probe to the DNA or RNA. It 
will be necessary that the label provide sufficient sensitivity to detect the amount of DNA or RNA available for 
hybridization. 

15 When the promoter sequence of the invention is used as a probe for hybridizing to a gene, the gene is 
preferably affixed to a water insoluble solid, porous support, such as nitrocellulose paper. Hybridization can 
be carried out using labeled polynucleotides of the invention and conventional hybridization reagents. The par- 
ticular hybridization technique is not essential to the invention. 

The amount of labeled probe present in the hybridization solution will vary widely, depending upon the nat- 

20 ure of the label, the amount of the labeled probe which can reasonably bind to the support, and the stringency 
of the hybridization. Generally, substantial excesses of the probe over stoichiometric will be employed to enh- 
ance the rate of binding of the probe to the fixed DNA. 

Various degrees of stringency of hybridization can be employed. The more severe the conditions, the gre- 
ater the complementarity that is required for hybridization between the probe and the polynucleotide for duplex 

25 formation. Severity can be controlled by temperature, probe concentration, probe length, ionic strength, time, 
and the like. Conveniently, the stringency of hybridization is varied by changing the polarity of the reactant sol- 
ution. Temperatures to be employed can be empirically determined or determined from well known formulas 
developed for this purpose. 

30 3. Nucleotide Sequences Containing the Gene Promoter 

This invention also provides the DNA sequence encoding the promoter of the invention linked to nucleic 
acids, such as nucleic acids comprising a structural gene. The nucleic acid can be obtained from any source, 
for example, from plasmids, from cloned DNA or RNA, or from natural DNA or RNA from any source, including 

35 prokaryotic and eukaryotic organisms. DNA or RNA can be extracted from a biological material, such as biologi- 
cal fluids or tissue, by a variety of techniques including those described by Maniatis et al., Molecular Cloning: 
A Laboratory Manual , Cold Spring Harbor Laboratory, New York (ig82). The nucleic acid will generally be 
obtained from a bacteria, yeast, virus, or a higher organism, such as a plant or animal. The nucleic acid can 
be a fraction of a more complex mixture, such as a portion of a gene contained in whole human DNA or a portion 

40 of a nucleic acid sequence of a particular microorganism. The nucleic acid can be a fraction of a larger molecule 
or the nucleic acid can constitute an entire gene or assembly of genes. The DNA can be in a single-stranded 
or double-stranded form. If the fragment is in single-stranded form, it can be converted to double-stranded form 
using DNA polymerase according to conventional techniques. 

The DNA sequence coding for the promoter of the human viilin gene can be linked to a structural gene. As 

45 used herein, the term "structural gene" refers to a DNA sequence that encodes through its template or 
messenger mRNA a sequence of amino acids characteristic of a specific protein or polypeptide. The nucleotide 
sequence of the invention can function as an expression control sequence, that is, a DNA sequence that con- 
trols and regulates expression of the structural gene when operatively linked to the gene. In one embodiment 
of this invention, the DNA sequence encoding the human viilin gene promoter is operatively linked to the struc- 

so tural gene for human viilin. In another embodiment of the invention, the DNA sequence of the invention is opera- 
tively linked to a structural gene other than the gene encoding human viilin. 

The nucleotide sequence of the invention coding for the promoter of the human viilin gene can be used in 
transient assay systems bas don the us of fusion gen s. Th fusion g n can consist of th promoter of th 
inv ntion directing th synthesis of a r port rmolecul .Th relative amount of th r port rprot insynthesiz d 

55 under various conditions is pr sumed to reflect the ability of th promot r to dir ct transcription. Th trans i nt 
expression system should be based on the synthesis of an asily assayed report r protein having minima) or 
no eff ct on the physiology of th transfected cell. Exampl s of suitabl report r systems ar chloramphenicol 
acetyltransferase (CAT) and human growth hormone (hGH). Other systems that ar also useful includ th 
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lucif rase system and A t>-galactosidase, which have b en successfully used as r porter genes in eukaryotic 
cells. Thus, for example, the DNA s quence of th inv ntion can be operatively linked to a reporter gene, such 
as the g n ncoding human growth honmon (hGH)orthegen coding for the enzyme lucif erase. The Firefly 
luciferase g ne, its structure, and its expression in mammalian cells ar described in d Wet t al., MoL Cell. 

5 Biol. 7:725-737 (1987). 

The DNA sequence encoding the promoter of the human villin gene can also be linked to an oncogene. 
As used herein, the term "oncogene" means a gene having oncogenic or cancer causing potential. The term 
as used herein includes cellular oncogenes known as proto-oncogenes, as well as viral oncogenes. When the 
DNA sequence encoding the promoter for the human villin gene is operatively linked to an oncogene, the result- 

10 ing polynucleotide is referred to herein as an "activated oncogene sequence." 

Previous studies have demonstrated that villin is a marker of digestive epithelium cell lineage. Villin gene 
expression is strictly regulated during embryogenesis and terminal differentiation. Furthermore, the tissue 
specific expression of villin is maintained during the carcinogenesis process. Villin can be considered as a mar- 
ker for colorectal carcinoma and can be used to characterize the colorectal origin of unknown metastasis. As 

15 a consequence of these properties, the villin gene is a good candidate to target the expression of oncogenes 
in the digestive epitheliums of transgenic mice. Oncogenes such as SV40T antigen, whose ability to promote 
tumors in transgenic mice has been fully demonstrated, K-ras or p53, whose mutated forms have been found 
associated with colorectal cancers in humans, can be placed underthe control of the villin promoter of the inven- 
tion. This makes it possible to obtain transgenic mice developing specific tumors deriving from intestinal 

20 epitheliums. 

Thus, in a preferred embodiment of the invention, the DNA sequence coding for the promoter of the human 
villin gene is operatively linked to an oncogene that is associated with tumor cells of the gastrointestinal tract 
or renal system of the human. Oncogenes associated with colon carcinoma are particularly preferred for use 
in this invention. Examples of particularly preferred oncogenes are TAG, ras, myc, and p53 genes. It will be 

25 understood, however, that other oncogenes can be linked to the promoter sequence of the invention. For 
example, the promoter sequence can be linked to oncogenes that have appeared in retroviruses, such as the 
oncogenes abl, erb-A, erb-B, ets, fes/fps, fgr, ms, fos, kit, mil/raf, mos, myb, K-ras, ret ros, sis ski, src, yes. 
Similarly, the promoter sequence of the invention can be linked to oncogenes that have not appeared in ret- 
roviruses, such as the oncogenes bcM, bcl-2, bcr, Btym, dil, int-1,inL2, L-myc, mcf2, mcf3, met MM-1, Mivi-2; 

30 Mlvi-3 neu N-myc, N-ras, onc-D, pim-1, pvt-1, RMO-int-1, tel-1, Tlym-1, TIym-2, tx-1, tx-2, tx-3, tx~4. 

The protein products of many of the oncogenes are protein kinases or protein phosphorylating enzymes. 
Thus, the activated oncogene sequence can be used to study host expression mechanisms by detecting exp- 
ression of these products. 

35 4. Vectors Containing the Gene Promoter 

This invention also provides cloning and expression vectors containing the DNA sequence of the invention 
coding for the promoter of the human villin gene. 

More particularly, the DNA sequence encoding the promoter can be ligated to a vehicle for cloning the sequ- 

40 ence. The major steps involved in gene cloning comprise procedures for separating DNA containing the gene 
of interest from prokaryotes or eukaryotes, cutting the resulting DNA fragment and the DNA from a cloning ve- 
hicle in specific sites, mixing the two DNA fragments together, and ligating the fragments to yield a recombinant 
DNA molecule. The recombinant molecule can then be transferred into a host cell, and the cells allowed to rep- 
licate to produce identical cells containing clones of the original DNA sequence. 

45 The vehicle employed in this invention can be any double-stranded DNA molecule capable of transporting 
the promoter of the invention into a host cell and capable of replicating within the cell. More particularly, the 
vehicle must contain at least one DNA sequence that can act as the origin of replication in the host cell. In addi- 
tion, the vehicle must contain two or more sites for insertion of the DNA sequence encoding the promoter of 
the invention. These sites will ordinarily correspond to restriction enzyme sites at which cohesive ends can be 

so formed, and which are complementary to the cohesive ends on the promoter sequence be ligated to the vehicle. 
In general, this invention can be carried out with plasmid, bacteriophage, or cosmid vehicles having these 
characteristics. 

The nucleotid sequence of th invention can hav cohesive ends compatible with any combination of sites 
in the vehicle. Alternativ ly.th s quenc can have one or more blunt ends that can be ligated to correspond ing 
55 blunt nds in the cloning sites of th vehicle. The nucleotide sequence to b ligated can b further processed, 
rfdesir d, by successive exonucleased letion, such as with the enzyme Bal 31. In the event that the nucl otide 
sequ nceofth invention does not contain a desired combination of coheshv ends.th s quence can b mod- 
rfl d by adding a linker, an adaptor, or homopolymer tailing. 
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It is preferred that plasmids us d for cloning nucleotide sequences of the invention carry one or more g nes 
responsible for a useful characteristic, such as selectable marker, display d by the host c II. In a pref rred 
strategy, plasmids having genes for resistance to two different drugs are chos n. For example, ins rtion of the 
DNA sequence into a gene for an antibiotic inactivates the gene and destroys drug resistance. The s cond 
5 drug resistance gene is not affected when cells are transformed with the recombinants, and colonies containing 
the gene of interest can be selected by resistance to the second drug and susceptibility to the first drug. Pre- 
ferred antibiotic markers are genes imparting chloramphenicol, ampicillin, or tetracyline resistance to the host 
cell. 

A variety of restriction enzymes can be used to cut the vehicle. The identity of the restriction enzyme will 

10 generally depend upon the identity of the ends on the DNA sequence to be ligated and the restriction sites in 
the vehicle. The restriction enzyme is matched to the restriction sites in the vehicle, which in turn is matched 
to the ends on the nucleic acid fragment being ligated. 

The ligation reaction can be set up using well known techniques and conventional reagents. Ligation is 
carried out with a DNA ligase that catalyzes the formation of phosphodiester bonds between adjacent 5'-phos- 

15 phate and the free 3'-hydroxy groups in DNA duplexes. The DNA ligase can be derived from a variety of micro- 
organisms. The preferred DNA ligases are enzymes from E. colt and bacteriophage T4. T4 DNA ligase can 
ligate DNA fragments with blunt or sticky ends, such as those generated by restriction enzyme digestion. E. 
colt DNA ligase can be used to catalyze the formation of phosphodiester bonds between the termini of duplex 
DNA molecules containing cohesive ends. 

20 The vehicle to which the DNA sequence of the invention has been ligated is useful for cloning the sequence. 
Cloning can be carried in prokaryotic or eukaryotic cells. The host for replicating the cloning vehicle will of course 
be one that is compatible with the vehicle and in which the vehicle can replicate. When a plasmid is employed, 
the plasmid can be derived from bacteria or some other organism or the plasmid can be synthetically prepared. 
The plasmid can replicate independently of the host cell chromosome or an integrative plasmid (episome) can 

25 be employed. The plasmid can make use of the DNA replicative enzymes of the host cell in order to replicate 
or the plasmid can carry genes that code for the enzymes required for plasmid replication. A number of different 
plasmids can be employed in practicing this invention. Typical of the plasmids that can be utilized are pBR322, 
pBR325, ColEI, and RP4. A filamentous, lysogenic phage, such as M13 phage, can be employed for cloning 
the nucleotide sequences of the invention. 

30 The DNA sequence coding for the promoter of the human villin gene and the other nucleotide sequences 

of the invention, such as the activated oncogene sequence, can also be ligated to a vehicle to form an expres- 
sion vector. The vehicle employed in this case is one in which it is possible to express a gene operatively linked 
to the promoter of the invention in an appropriate host cell. It is preferable to employ a vehicle known for use 
in expressing genes in mammalian cells. These vehicles include SV40 vectors and eukaryotic expression vec- 

35 tors derived from SV40 vectors. Constructs including the DNA sequence encoding the promoter of the invention 
associated with the gene coding the large T antigen of SV40 are preferred, because the antigen gives trans- 
forming power to permissive cells for the expression of the promoter. Other vectors include those derived from 
Maloney murine sarcoma virus (MSV), herpes simplex virus, Rous sarcoma virus, the mouse mammary tumor 
virus (MMTV), vaccinia virus, bovine papillomavirus, human adenovirus, and Maloney murine leukemia virus. 

40 In one embodiment of this invention, the DNA sequence coding for the promoter of the human villin gene 

was operatively linked to the Firefly luciferase gene, and the resulting nucleotide sequence was ligated to plas- 
mid pSVOA using the techniques described in de Wet et ah, supra. The plasmid pSVOA is shown in Fig. 5. 

Referring to Fig. 5, the structures of the mammalian expression vector pS VOA and the full-length, intronless 
luciferase gene L are shown. The expression vector pSVOA is derived from pSV2, an SV40 early-region pro- 

45 moter vector. Portions of the vector derived from pBR322 are shown as a thin line, those from SV40 are shown 
as a heavy line. Coordinates of the endpoints of the pBR322 segments are indicated and correspond to the 
location of the restriction sites in the published sequence of pBR322. Coordinates indicated in bold type cor- 
respond to the locations of the restriction sites in the published sequence of SV40. The vector pSVOA contains 
the SV40 small-t-antigen intron and an SV40 polyadenylation signal labeled An. The vector pSVOA contains 

so two copies of the SV40 polyadenylation signal. The full-length, intronless luciferase construct, designated L, 
is carried as W/ndlll-BamHI fragments in the plasmid vector pUC18. The name of the corresponding plasmid 
is also indicated below the map. The coordinates of the indicated sites are taken from the sequence shown in 
Fig. 1 of the de Wet et al. publication, supra. Restriction sites are indicated with the following abbreviations: 
H3, H/nolll; Ri, EcoRI; Bs, Bsm\; Xb; Xba\; H2, H'mc\\\ Sp, Sspl; Sa, Sad; Kp, Kpn\; Sm, Smal; Ba, BamHI. 

55 Th r suiting plasmid derived from pSVOA containing the DNA sequence ncoding the promoter of th 

human villin g ne operatively linked to the lucif ras gene has been designated pPrVL Th plasmid pPrVL 
was deposited with the Collection National de Cultures de Microorganism s (C.N.C.M.) of Institut Past ur in 
Paris, France on October26, 1990, under cultur coll ction deposit Accession No. 1-101 1. The procedur used 
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to pre par the pPRVL construct is diagrammatically shown in Fig. 6. 

Referring to Figure 6, details cone rning the construction of pPrVL are giv n. A Hindlll-Sall restriction frag- 
ment from pSVOAL containing the luciferase (Luc) g ne was ligated to a 1 .7 kb Sall- Hind lll fragment obtained 
from a human genomic library (lambda EMBL 3). The latter fragment was obtain d by scr ening the library using 
5 a cocktail of 5' to 3' cDNA probes specific for the human villin gene. This construct is referred to as plasmid 
pPrVU. 

A second plasmid was constructed to clone a Hindlll/Pstl fragment from the same library containing the 
CAP site of the villin gene. This plasmid was named PrV -300/+77. A Hindlll-EcoRI fragment from this plasmid 
was then ligated to a Hind lll fragment from pPrV 1.7 and an EcoRI- Hind lll adaptor. The resulting plasmid 
10 (pPrVL) contained the 5' region of the villin gene (1.4 kb insert), the villin promoter and the luciferase gene. 

5. Cells Transformed with Vectors Containing the Gene Promoter 

The vectors of the invention can be inserted into host organisms using conventional techniques. For 

15 example, the vectors can be inserted by transformation, transfection, electroporation, microinjection, or by 
means of liposomes (lipofection). The preferred method for inserting the vectors into mammalian cells is depen- 
dent on cell type; however, calcium phosphate precipitation of DNA has been found to be suitable. 

Cloning can be carried in prokaryotic or eukaryotic cells. The host for replicating the cloning vehicle will of 
course be one that is compatible with the vehicle and in which the vehicle can replicate. Cloning is preferably 

20 carried out in bacterial or yeast cells, although cells of fungal, animal, and plant origin can also be employed. 
The preferred host cells for conducting cloning work are bacterial cells, such as E. coli, as well as species of 
str&ptomyces, Bacillus, and Pseudomonas. The use of E. coli cells is particularly preferred because most clon- 
ing vehicles, such as bacterial plasmids and bacteriophages, replicate in these cells. 

In a preferred embodiment of this invention, an expression vector containing the DNA sequence encoding 

25 the promoter of the invention operatively linked to a structural gene is inserted into a mammalian cell using 
conventional techniques. For example, the vector can be transfected into a mammalian cell by the calcium phos- 
phate precipitation method or the DEAE-dextran method. A preferred method based on calcium phosphate pre- 
cipiation is disclosed in Chen and Olsyama, Mol. Cell. Biol., 7:2745-2752 (1987). 

Suitable host cells include cells that express villin and cells that do not express villin. Host cells that are 

30 preferred for this purpose include human intestinal cell lines derived from colic adenocarcinoma, such as HT29 
orCaC02 cells; human epithelial cell lines, such as C33 cells; and human hepatoma cell lines, such as HepG2 
cells; cell lines originating from the renal, proximal tubule of a pig, such as LLCPK1 ; and cells that do not express 
villin, such as HeLa cells, which originate from an adecarcinoma of the neck of the uterus. After the plasmid 
has been introduced into the host cell, the structural gene can be expressed using conventional techniques. 

35 These cell lines can be used with recombinant vectors containing the Firefly luciferase gene under the control 
of the DNA sequence encoding the promoter of the invention in order to isolate regions of the promoter that 
can be employed for the specific expression of the villin gene in a limited number of tissues. Cell lines have 
been transfected with plasmid pPRVL of the invention. This plasmid contains the Firefly luciferase reporter 
gene. 

40 

6. Transgenic, Non-Human Mammals 

Normal mice do not develop intestinal tumors spontaneously. It is possible, however, to induce adenocar- 
cinomas from small and large intestines by using chemical agents. The histology of these tumors resembles 

45 that of corresponding human tumors, and they have been used to study external factors involved in the car- 
cinogenesis process. The usefulness of this model is, however, limited for two main reasons. First of all, the 
model reproduces only some aspects of human carcinogenesis. For instance, metastasis are not observed, 
especially in the liver. Secondly, tumors have to be induced chemically in each mouse to be studied. This 
reduces the reproducibility of the model. Moreover, tumor development is very slow (average 7 months). 

so Transgenic mouse technology is very suitable to develop animal models of carcinogenesis, since this 
technology makes it possible to obtain mouse lines that genetically develop specific cancers in a reproducible 
manner, even those that do not appear spontaneously in this animal. 

Accordingly, this invention also provid s transgenic animals containing the promot r sequence of th 
invention. More particularly, the invention provides a transgenic non-human mammal whose g rm cells and 

55 somatic cells contain an activated oncogene sequence of the invention introduced into the animal, or an ances- 
tor of the animal (e.g. an F1 animal), at an embryonic stage. Preferably, th non-human mammal is a rodent, 
such as a mouse. The activated oncogene sequenc , when incorporated into the genome of the mammal, 
increases the probability of the d velopment of neoplasms, particularly malignant tumors, in th animal. In a 
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preferred embodiment of th invention, the neoplasms are analogs of human colon cancer. 

Th r are several methods by which the oncogene s quence of th inv ntion can be introduced into an 
animal mbryo to ensure that it is incorporated into the chromosome of the animal in an activated stat . On 
method is to transfect the embryo with the g ne and to select transgenic animals In which the gen has inte- 

5 grated into the chromosome at a locus that results in activation. Other methods involve modifying the oncogene 
prior to introduction into the embryo. One such method is to transfect the embryo using a vector containing the 
activated oncogene sequence of the invention. In a preferred embodiment of the invention, the chromosome 
of the transgenic animal includes an endogenous coding sequence, preferably the p53 t ras, myc, or Tag gene, 
which is substantially the same as the oncogene sequence, and transcription of the oncogene sequence is 

10 under the control of the promoter sequence of the invention. 

Introduction of the activated oncogene sequence of the invention at the fertilized oocyte stage, such as by 
microinjection, ensures that the oncogene sequence will be present in all of the germ cells and somatic cells 
of the transgenic animal, such as the mouse. The presence of the activated oncogene sequence in the germ 
cells of the transgenic animal ensures that all of the descendants of the animal will carry the activated oncogene 

15 sequence in all of their germ cells and somatic cells. The activated oncogene sequence of the invention can 
be introduced at a later embryonic stage, although this might result in the absence of the oncogene from some 
somatic cells of the parent animal, although descendants of the animal that inherit the oncogene sequence will 
carry the sequence in all of their germ cells and somatic cells. 

The animals of the invention can be used to test a material suspected of being a carcinogen by exposing 

20 the animal to the material and determining neoplastic growth as an indicator of carcinogenicity. The animals 
can also be used to test materials for ability to confer protection against the development of neoplasms. In either 
case, the animals are treated with the material, and a reduced incidence of neoplasm development compared 
to untreated animals is detected as an indication of carcinogenicity or protection. The invention also provides 
methods of exposing treated and untreated animals to carcinogen prior to, after, or simultaneously with treat- 

25 ment with a protective material. 

The transgenic animal model of the invention provides an experimental approach to the different early and 
late steps of carcinogenesis, and in particular for studying oncogene implication and synergy, the apparition 
of proliferative hyperplasy, which occurs before the development of solid tumors, and the correlation with 
angiogenesis. 

30 The villin promoterof the invention is the only promoter isolated to date, which is able to drive the expression 
of an oncogene in the epithelium of the large intestine. Thus, the DNA sequence encoding the promoter of the 
human villin gene is the only way to obtain transgenic animals developing colorectal carcinogenesis. 

7. Cultures of Transgenic Animal Cells 

35 

intestinal cell lines derived from spontaneous colorectal carcinomas are available. These cell lines have 
been used to establish cell culture models for absorptive enterocytes and goblet cells. However, the establish- 
ment of cell lines for PANETH entero-endocrine cells, and M cells has failed. Furthermore, no well defined cell 
line models for duct cells of the exocrine pancreas and for biliary cells of the liver are available. 

40 The transgenic animals of the invention can be used as a source of cells for cell culture. Cells from the 
transgenic animals advantageously exhibit desirable properties of both normal and transformed cultured cells. 
That is, they are normal or nearly normal morphologically and physiologically, but can be cultured for longer, 
and even indefinite, periods of time. 

More particularly, tissues of the transgenic animals of the invention can be cultured in order to obtain stable 

45 cell lines, which reproduce the differentiated p he no type of the corresponding epithelium, and which can be 
analyzed for the presence of the activated oncogene, either by directly analyzing DNA or RNA, or by assaying 
the tissue for protein expressed by the gene. Cells of tissues carrying the gene can be cultured using standard 
tissue culture techniques. The cells are useful for studying the functioning of cells from normally difficult to cul- 
ture tissues. The cells can be used to study terminal differentiation processes. More particularly, cultures of 

50 the transgenic animal cells can be used to characterize the epithelial lines that possess specific differentiation 
criteria, either of the small intestine or the colon, of ductal cells of the pancreas, or of biliary cells of the liver 
or of the kidney proximal tubule. 

Fig. 7 is a diagrammatic representation of th preparation of an animal cell mod I for colorectal cancers 
and cell cultures derived from animal tumors. 

55 Referring to Figure 7, details are given concerning the protocol which is followed to obtain th animal and 

cell cultur mod I for colorectal cancers. A vector containing th villin promoter and an oncogen (preferably 
p53, ras, myc or Tag) is transfected into a mouse oocyte. A selection procedure is used to establish which mice 
carry the desired gen s. Thes animals ar maintained as a colony in which the g nes can b readily detected 
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10 



15 



20 



25 



in both somatic and germ cells. 

Animals carrying th activated oncog ne can be then be used to investigate, for xampl , the induction of 
tumors in the digestive epithelium where villin is normally xpressed. Using these animals one can develop an 
animal model for colorectal cane rs. Alternatively, one can establish cell lines in culture from the digestive 
epithelium of transgenic animals carrying the villin gene and promoter which might be coupled to ras or myc 
to permit the in vitro study of colorectal tumor cells. 



Claims 

1/ An isolated DNA sequence corresponding to a promoter of the human villin gene, wherein the nucleotide 
sequence consists essentially of : 
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27 A DNA sequence comprising the sequence as claimed in claim 1 operatively linked to a nucleotide sequ- 
ence of an oncogene. 

55 3/ Th DNA sequence as claimed in claim 1 or 2, wherein the oncogene is selected from th group con- 
sisting of p53, ras, myc and Tag. 

4/ An isolated DNA sequence as claimed in claim 1 operatively link d to a structural gene. 

5/ An isolated DNA sequence comprising the sequ nee as claimed in claim 1 or 4 op ratively linked to a 
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gene encoding villin. 

6/ A DNA sequence comprising the sequence as claimed in claim 1 operatively linked to a gene encoding 
luciferase. 

71 An isolated RNA sequ nc complementary to th nucleotide sequ nee according to anyone of claims 
5 1 to 6. 

8/ A vehicle comprising a plasmid a plasmid or viral vector containing the nucleotide sequence as claimed 
in anyone of claims 1 to 6. 

9/ The vehicle as claimed in claim 8, wherein the viral vector is an SV-40 vector. 

10/ The vehicle as claimed in claim 8, wherein the plasmid is pSVOAL 
10 1 1/ The vehicle as claimed in claim 8 having the identifying characteristics of the vector having culture col- 
lection accession number CNCM (Collection Nationale des Microorganismes, Paris, France) 1-1011. 

12/ The vehicle as claimed in claim 8, wherein the vector is an expression vector. 

13/ A transgenic, non-human mammal all of whose germ cells and somatic cells contain a recombinant 
activated oncogene sequence according to anyone of claims 2 to 5, 5, which has been introduced into the mam- 
15 mal, or an ancestor of the mammal, at an embryonic stage. 

14/ The mammal as claimed in claim 13, wherein a chromosome of the mammal includes an endogenous 
coding sequence substantially the same as a coding sequence of said oncogene sequence. 

15/ The mammal as claimed in claim 14, wherein the oncogene sequence has been integrated into a 
chromosome of the mammal at a site different from the location of the endogenous coding sequence. 
20 16/ The mammal as claimed in anyone of claims 13 to 15, wherein the mammal is a rodent. 

17/ The animal as claimed in anyone of claims 13 to 15, wherein the mammal is a mouse. 
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t&DSila. Met 

CCTCTCCCCCAGGCTCACTC ACC ATG ACC AAG CTG AGC GCC CAA GTC AAA GGC TCT CTC 
AAC ATC ACC ACC CCG GGG CTG CAG ATA TGG AGG ATC GAG(*97) ~ IVS 1 
(390 bp) — GCC ATG CAG ATG GTG CCT GTT CCT TCC AGC ACC TTT GGA AGC 
TTC TTC GAT GGT GAC TGC TAC ATC ATC CTG GCT(+/ 72) — IVS 2 (1300 

bp) — ATC CAC AAG ACA GCC AGC AGC CTG TCC TAT GAC ATC CAC TAC TGG 
ATT GGC CAG GAC TCA TCC CTG GAT GAG CAG GGG GCA GCT GCC ATC TAC ACC ACA 
CAG ATG GAT GAC TTC CTG AAG GGC CGG GCT GTG CAG CAC CGC GAG GTC CAG GGC 

AAC GAG AGC GAG GCC TTC CGA GGC TAC TTC AAG CAA GGC CTT GTGf*37o; — 

IVS 3 (>2000 bp) — ATC CGG AAA GGG GGC GTG GCT TCT GGC ATG AAG 
CAC GTG GAG ACC AAC TCC TAT GAC GTC CAG AGG CTG CTG CAT GTC AAG GGC AAG 
AGG AAC GTG GTA GCT GGA G(*476) — IVS 4 (450 bp ) — AG GTA GAG 
ATG TCC TGG AAG AGT TTC AAC CGA GGG GAT GTT TTC CTC CTG GAC CTT GGG AAG 
CTT ATC ATC CAG TGG AAT GGA CCG GAA AGC ACC CGT ATG GAG AGA CTC AGG 

G(+S90) — IVS 5 (1000 bp) — GC ATG ACT CTG GCC AAG GAG ATC 
CGA GAC CAG GAG CGG GGA GGG CGC ACC TAT GTA GGC GTG GTG GAC GGA GAG AAT 
GAA TTG GCA TCC CCG AAG CTG ATG GAG GTG ATG AAC CAC GTG CTG GGC AAG CGC 
AGG GAG CTG AAG GCG GCC GTG CCC GAC ACG GTG GTG GAG CCG GCA CTC AAG GCT 

GCA CTC AAA CTG TAC CA(+792) — IVS 6 — T GTG TCT GAC TCC GAG GGG 
AAT CTG GTG GTG AGG GAA GTC GCC ACA CGG CCA CTG ACA CAG GAC CTG CTC AGT 
CAC GAG GA(+873) — IVS 7 — C TGT TAC ATC 

FIG. 3 
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GTCGA CCTGCA GGTCAACQG ATCATCGATTTAGTTAACCC CCTAAGTTGACCCCCTCAQC - 1 8 5 0 
T CAT AGAT G<\AGAAGGGAAG TCTACGTTCATACAQCTAGT GAGTGTCAAAGAAAAAGAGG - 17 9 0 
GACGGGGCCCGGCGCGGTGG CT CACQCCT GT A ATT CCAQC AATTTAGGAGGCTGAGGCGG - 17 3 0 
GTGGATCACCTGAGGTCAGG AGTTCGAGACCAGCCTGGCC A AC AT GTGGT GGAA AT CCCG - 1 6 70 
TCTCTATTAAAAATACAAAA TTAGCTGGGCTTGAGGCTCG CGCCTGTAATCCAGCTACTT - 1 6 1 0 
CAGGGAGGCTGAGGCAGGAG AATCACTTGAACCCGGGAGC CGAGGTTGAAGTAAGCCGAG -15 5 0 
ATTGTGCCATTGTACTTCAG CCTGGGCAACAAGAGTGAAA CT CT GT CCC AA AA AT A A AT A - 1 4 9 0 
AATAAATAAATAAATAAAAG AAGGGGACAGAGACAAAGGT GAT GAC AGA A AT GAA AC AGA - 1 43 0 
AAACGGTCTCCTACTTTATA CCCCAACCTGGTTCTGCTGG ACCAGGAATCCTGGGAGCCC - 1 370 
TGGGGACTTGGTAGGTTGAA AAATACCTGATGGAGGAGAC CATGGTTXXXXCCTTGATGG - 1 3 1 0 
AGAAGGCCAT AGCCAACAGG CC AT CAGACAGGACCCAAGC CCCAAGGGGCTCCTACAGTT - 1 2 50 
A C AC QGC AAGCT OCT GCT A A TCAGTCGG^GCAGGCAATGA TCAGAGACTGGGCAGGGCTT - 1 1 90 
ACCTCCGAACTCTAGAACGC CAGGGAAGCAGCAGCCACCC CTCAGGCTCAGAAAAGGGAG - 11 30 
TTTCTAAGTACTAAAAGGCC A AGT CACT A AGAGGT GACT C AAA GT C A AGGT GC A A GGAGA - 1 070 
GATTAGATAGTGAGATTTCA GGCT GT C ACGGAGT GG^GGC CTGACCTGTCGGCACATGCC - 1 0 1 0 
CTCTCATTCATTATTCATCA GTT C A AGGAGCATT CAT GGA GGGCAGCTCTCTCGAGGCTG -950 
GGTGATAGGCTCCATGGAGA T CGGGGTT GGCCT CT CT GGAT GAGGAGGAACCTCAAAACTA - 89 0 
GGGAGGAGAATAACTGGGCA CT A GT GGCT GT CT GC CT A GT T CAT AT C AGAAT CACTGGGG - 83 0 
AGTTTTAAAAGCACCCCATG CCT C ACCTCCCAAAAGGCAG AAAATAAAGCACCCATGCCC -770 
AAGCCCCACCCT GGCCT AAT TAAATCAGAATACCAGAGCC T GGAGCAT C AGC ACGCT XCC - 7 1 0 
GAAGCT CCCCAGCTGAGGCT GATGAGCAGCAGGAGCGAAT ACAGCTACAGGTCCAAGCAA - 650 
AAAGGGCCGGCCAGACACGT GGCT CATGCCTT GTT AATCC CAGCACTTTGGGAGGCTGTG - 5 90 
ACAGGT GGAT CACTT GAGGT CAGGAGTTTGAGACCAGCCT GGCCCACAT GGGGA A ACCCT - 53 0 
GTCTCTACTAAAAATACAAA ATT AGCCAGCGT GGCT GT C A CGCCTGTAATCCCAGCTATG - 4 70 
CGGGAGGTT GAGGT A GGAGA AT C ACT TGA ACCT GGGAGGC AGAGGTTGCAGTGAGCCAAG - 4 1 0 

FIG. 4'A 
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AT CACGACACTGCACT CCAG CCTGTGCGACAGAGCGGGAC TCCATCTCAAAAAGAAGAAG - 3 5 0 

AAGAA AAGAAT AAT AT GGAG AAAGGGCTGACATGAGGAGA AA AGCTT GGGGAGGGGGGTT -29 0 

CCTT CCAGTTT AGCGGGCAG AGAGT AGTTCACAGT GGGAG GGTTCACTCTGTTCATTCAT -230 

TCATCCATCTCTCACTCCTT CCCTCCCCCTCTCTTCCTTA TTGAGCAAAGTCACTGTGCT - 1 70 

AGACACTGCGGATTTGGTAC TAAACAAGCGGAAAT GGTCC CTGAGTGGGGAGTAACCCTC -110 

TGTGCTCCCCCAGGCCCCTG AGCCTGCGACCTCCT GGCAC TGTTGGGAGTTCTCTAGGGG - 5 0 

+1 (CAP site) 

CAGGCTGAAGGGCAGCATCC GAACTCAGGTGCCCCTGCTC CCAACGCCT TSTCCCCCA 
+ 21 

GGCTCACTCACC a?3 ACC AAG CTG AGC GCC CAA GTC AAA GGC TCT CTC AAC AT 

+ 80 

C ACC ACC CCG GGG CTG CAG GTCGAC 



FIG. 4'B 



45 



EP 0 496 174 A1 



Ln 

















D 




— — o 


o — 




C/) 








Z 




— > 




O H- 


0> 


H 






> 


< 




O 


0 3 




0> 










o 


o 


n 




X) 








CO 


— H 


> — 








w 




(A 




CD 


o 






»-« 


w 




(D 


50 


— <D 


o — 










*o 


X) 






H 








2C 


w 


C 


a 


\o — O 


o — 


CD 








rr 


(D 


cn 


• O 


o 








h-» 




3 


50 


H 


> 










M 


O 




O 




2 










0) 


1 



u> 



ha ls> fsj io 
O O O 



Ln 



Ui u u 



> 



00 



O O 

> H 

CD O 

35 

o o 

o cd 

o o 

h > 

> H 

a o 

H > 

> H 

O O 

o o 

53 

> H 
H > 
O CD 

> H 

> H 
H > 
H > 
O O 

o o 
a o 
o o 
o n 



cn — 
Cn i& — 
Ln 



CD 
O 



3 
O 



— > w 



> 

o 

> 

»-3 



H 
H 



H 
O 



H 
H 
O 



□ 20) 
•O E? 0» 
DOC 
U> 

O H H > 

H 



x be 

d a» to 
o co 



o 
a 

(D 



— h X 
I— 
3 
O 



o o 

cd o 

cn o 

CD o 

cd o 

> H 

O O 

H > 

n a 

cd o 



3 

3 O 

»- a 



o a 

H > 

> H 

O O 



vo 



O 
H 
O 
O 

> 

o 
o 

o 
o 
> 

cd 



o 

H 

<d 
o 
> 

CD 
CD 
H 

o 

CD 
> 

n 



3 
(D 
Of 
M 



P0 
(D 
U) 

rr 

M 

O 
rr 
p" 
0 

.3 

P> 



> 
> 



2 

o 



> 

CD 



n o 
n o 



> 
> 

£D 



CO 

o 



> 

o 



46 



Ef> 0 496 174 A1 



V0 

on 



H H N) KJ N) 
fO N) 00 OD 0D 



H H W NJ NJ NJ N) i 

ro k> in cn ut cn cn 

ON ON 



H W U 

u> ro n> 



TJ 

Q 

09 



> H 




a o 




> H 




— H > — 




O O 


bi 


o o 


(D 


> H 




> H 


h- 1 


O O 




• H > 




> H 




H > 




O O 




— H > 


> 


o o 




— o o — 


c 2 


> H 




H > 




o o 








o o 








O (D 




> »-5 




O O 




•to t-J 




o o 




H > 




o o 




















o O 








o o 




• l I \i * 




O <T) 




*-i ^ 




O O 






m ^ 
i*j 


r\ r\ 


\ i " 


— r\ — 


O Q> 


u u 


\J 


— O O — 




— r\ r\ — 


w ^» 


• Ui l J 


SO 


t J W / 










•J 


l J y J 


w 






\ 1 y / 




l J b J 




J* M 




1 J y ) 




• t J U J 




O O 








o cn 




H > 




o n 




o o 




o o 




o o 












> ^ 




H > 




H > 




> H 




> H 




a o 




o o 




H > 




o a 




♦ on 





> cu w z: 

0 0) W H 
1> D 13 B» 



►H 0D 
ON 

X 

K CD H 
►a rr 

c 



c/> 

Di a: en 

C ft) CD 

*£> (D C CD 

ON VO 0 

ON D 



in 

^ 0> *D 



ON 

o 



47 



EP 0 496 174 A1 



33 

• 

O 



> H 

o n 

o o 

> H 



> 

H 
H 
H 
H 
H 
> 



(D O 
H > 



ho 



H 
H 
H 
> 
> 
H 

n 



> 
> 
> 

> 

CD 



o o 

> H 

n o 

o o 



O 
CD 
> 
> 



o 
o 

H 



00 

o 



o 
en 



2 
w 
(D 



00 



> 

C 



CO 
ON 



CO 



n o 

H > 

n o 
o o 
o o 

> H 

a o 
o o 
o o 
o o 
o o 
o o 

> H 

n o 

> H 
H > 
H > 

> H 

CD O 

<d o 

o o 
o o 

> -3 
H > 

cd o 

5d 

o o 

H > 

O O 

o o 
o o 

H > 

O <D 

O O 

CD O 

> H 
— O CD — 



2 



CP 

X w x 
7 rr k 
& CD 
T5 



H-» h-» CO 

CO CO CJ 



o 



> 

c 



2 

3 



M O 

a 





H 


> 






n 


o 






n 


o 






o 


o 






H 


> 






n 








o 


a 






H 


> 






— O 


o — 


X 


NJ 






H- 


O 






3 








(-tt 






N> 








O 


M 



2 
3 



H > 

H > 

> H 

53 

H > 

O CD 

o a 

o o 

O CD 

o n 

> H 

o o 

o cd 

O CD 

O O 

n cd 

O CD 

o o 

> H 

o <d 

o o 

H > 

> H 

o o 

H > 

o o 

o n 



> 

o 

H 
O 
O 
> 



O 



NJ fO fO — 
»-* k-» H* 

M NJ ro ^ A — 
N) H H H — 

»-* a\ a\ a\ 









NJ — 






















U> 





H 
CD 
> 
CD 

o 



H 
H 
> 



2 
3 



a 

(D 2 
3 



> 

M O 2 <fl 
* or 0* 

3 O C X 
HH>y 

03 

O M M U5 

a c 

CD 2 cu 
3 a\ 



O O 

H > 

o a 

n cd 

H > 

O CD 

> H 

> H 
O O 

n o 

H > 

n cd 

o o 

o n 

H > 

o id 

o n 

(D O 

> H 
O (D 

o a 

o o 

CD O 

H > 

H > 

CD O 

H > 

> H 
O CD 

> H 

n cd 

O CD 

> H 
O CD 

n o 

H > 



Of 



— 03 


PI CO 












(0 


o o 












— rt 


0 n 




tn 


X 






— z 




a» 




a* 


X 










(t> 








— h-l 






M 




CD 

H* 


Pfl 


— > 


2 










2 Z 


— r-h 


Z to 










(A 


t— 












X 


M 


tn 












H 


1-4 














HH ^ 
M 

M 













> 

H 

n 



cd n 
o n 
o o 



48 



EP 0 496 174 A1 



U U) U OJ u u u 

U) U U U) u 



.fife 
to o 

NO 



33 

t 

a 

o 



UJ u u u 
00 00 00 00 



H 

> 

o 

H 

o 
> 
> 
n 

H 
H 

o 
o 

(D 

n 
o 
n 

H 

n 

• o 

• o 

H 

n 
o 
> 

o 

H 
H 

n 
> 

H 
H 

o 
o 
o 
n 

H 

► o 

> 
> 

o 

o 
o 

► > 

- o 
> 

o 
> 
> 

o 

H 

► n 

- o 

> 

o 
o 
o 
o 

H 
H 

• O 
H 
H 

n 

o 
> 

o 

H 
H 

• H 



> 
> 
H 
O 
> 

o 

H 
O 

O - 
o - 
n 
o 

o - 
o 
> 
o 

o - 
o 

o - 
> 
o 
o 

H 
H 
O 
> 
> 

o 

H 

O 

o 
o 
> 

> 

H 
H 

<r> 

H 

o 
o 
> 

H 
H 
O ■ 
H 
> 

n 

H 
H 

n 
> 

<n 
n - 
n 

H 

o 

o 
n 
> 
> 

o 

> 

o 

H 
O 
> 
> 
> 



> CD Z t/) £/)(/) X 
<.OOO(033OJXSZ(/) 

I *— I hH H H H 25 



CO 

O 2 
3 



< 



ro 

00 V£> 

o 



50 

(0 



oo ro w w 

(A O O ft 

rr o n a 
Z 33 ^ 



o 



o 
o 



49 



EP 0 496 174 A1 



H 
H 
H 



cn cn cn — 
o o o 
-j 



31 

■ 

m 



en cn cn 
ro NJ 
K> ISJ Ni 



SO SO 



cn 



cn 
o 



cn cn 

CJ CJ 

cn cn 



cn 



cn 



so 



cn 
cj 
vo 



O O 

o <n 

n o 

> H 
Q O 

> H 

o n 
o o 

> H 
H > 
O O 

> H 

5 3 

H > 

> H 
H > 
O O 

o o 
o o 
o n 
h > 

o o ■ 
o o 

> H 

• o o 

o o 

> H 

> H 
<T> O 

> H 

o o 
o o 

> H 
cn cn — O CD 
H» m • O O 

H > 
O O 

o o 

H > 

O O 

o o 

H 
H 
> 

a 
o 
> 

o o - 
o o ■ 

H > 

O O 
O O - 

o o ■ 
o o 

> H 

o o 

o o 

o o 

r> o 

H > 

o n 

> H 

> H 

O O 

n o 

> H 
H > 
O O 
O O 

> H 

> H 
O O 
H > 
H > 



cd m co 
won 

rr O n 
Z 50 ^ 



< 



cn cn cn 

CJ CJ UJ 

o o o 



5 

H 

n 
o 

H 



cn 
&» 
c 

VO CD W (/> 

a\ w o o 

rr o n 

W Z ?3 ^ 

K 

H H H P* 



03P1W 
WOO 

o H 
50 



rr 
Z 



CO 

o 

Z M 



M Di 



cn — 
cn cn cj — 

CJ CJ 00 • 

vo vo 



fl> 

o 



CD W < 

WO O 

rr o «-t 

Z 50 ^ 



03 03 
fb w 
3 TJ 

C/J H M C/l 

H 00 (D 

<Ti O 



cn 

GO 



> 

> 

> 
a 
o 
o 

H 
H 
H 
H 
H 
> 
H 
H 
H 
> 
H 
H 
H 
> 
H 
H 
H 
> 

H 
> 

H 
H 
> 
H 
H 
H 
H 
O 
H 
H 

n 
o 
o 
o 

H 

o 

H 

► o 

H 
O 
H 
CI 
H 
H 
H 
- O 

o 
» > 

o 

> 

O 
H 

H 

o 

H 

• H 

H 
> 
O 
H 
H 
H 
<D 
H 
O 

• H 



O 
H 
O 
H 

CD 

O 
O 
O 
> 
> 
> 

> 

> 
> 
> 

> 

> 

> 

> 
> 
> 

> 
> 
> 
> 

> 

O 
(D 
O 
O 
> 

o 
> 

> 

o 

> 
> 

o 

O 
> 
H 

> 

> 
> 
> 

H 
O 
> 

> 



cn 

O 



03 
O 



50 



EP 0 496 174 A1 



\D SO SO 

cn cn cn 







H > 








O (D 








O CD — 03 CD 






OO-Q) W Z 






O CD 3 >0 h- 






cd n 


H-* CD 














o o » 


H CD H 






o o 


ON < 




• 


> H 








H > 


M 






O O 








H > 




0> 


— 


o o — : 


z 


CH 




> H 






> H 0 






H > 








O O » 


-1 






H > i 


— t 




« 


<D O • 


-» 






n o 








n <d 








o o 








H > 




ON 


— 


H > — 


> 


ON 


— 


O (D — H 03 ^ 


cn 




o o 


c cr 3 








< c: 




— 


o o — 






• 


o o 








> H 








o cd 


I— » 






o o 








> H 








H > 








H > 








> H 








CD O 








H > 






• 


O CD 








> H 








O O 








o o 








o o 








•H > 








n cd 








o n 








H > 








O O 






• 


O CD 








a o 








h > 








H > 




ON 


— 


> H — 


DO 




— 


O CD — 


DOS 






H > 


h-T3 cr 






> »-3 


3 O 







CD O — 








H > 






• 


n o 








H > 








O CD 








H > 








<D O 








> H 








O O 








n o 








o cd 








CD o 






• 


H > 








O O 








O O 








o o 








o o 








> H 








> «H 








H > 




-J 




cd n — 




h- » 




o o 


D 


GO 


• 


> »H 





(7% 



CO 

-J 



cn cn cn 
Cn CD CD CD 

co o O O 



cn 

00 



CD 

a* 
< 



c 



(Ti O 

o o 



CO 

c 

CO > 

> 



cn 



a\ a\ n> fo 

N> N> (VJ 0T\ OA 
—J *«4 -J 



H > 

H > 

H > 

> H 
H > 
O D 

o n 

> H 
O CD 
H > 

> H 
O CD 
O O 
H > 

o o 

O CD 

H > 

O CD 

H > 

o o 
o o 

H > 

> H 
O CD 
O CD 

> H 

> H 
O O 
(D O 

> H 

> H 
O <D 
H > 

> H 
O O 

n o 

H > 

n <d 

H > 

o o 

O CD 
CD O 
(D O 
H > 

> H 
H > 

n o 

n o 

o n 

h > 

Hi > 

<D O 

H > 

O (D 

O (D 

CD D 

(D O 

H > 

> H 

(D n 
o o 

H > 

o n 

H > 

O CD 

n o 

H > 



CD 
<D 



U> CJ 
CD 00 



o 
o 



CD O 
H > 



n 



> 

CD 



q n 

(D O 

O O 

o n 

»-3 > 



o 
o 



W CO 
ID ft Z 
0 < M 
0» 



M h- 



X 

CD 



CD 



0 

o 
o 

o 



0 > 5S W 
C < M 0» 
3C D» B» C 
vO 

HH H C\ 
M < 



cn co 
a* rr 
o ^< 



O 



o 



51 



EP 0 496 174 A1 



O O 

H > 

CD — C) O - 

O > H 

U> H > 

. H > 

00 — O Q - 

O H > 

qo -J — O O - 

o oo - O O - 

VO h* > H 

CD GO O — O O - 

»-* H > 

po n> on 

> H 
O O 
H > 
H > 
H > 

• no 

o n 
h > 

H > 

o o 
n o 

> h 
n o 
o n 

• h > 

H > 

o o 
n o 
h > 
o o 

T| no 

O > H 

f l/ • > H 

no 

H > 

• n o 
h > 

> H 

o n 

H > 

n o 
n o 
o n 

• > H 

CD — O O 
ON > H 

go h — o n ■ 

on H > 

cj on 
n o 
n o 
>-j > 
n o 

n o 

00 - O O 
GO 00 -J — H > 

co >j m — no 
>j u u no 
^ on 
o n 

> H 

n o 



a 



3 
3 



CO 



I— 
3 



U Ui U U) 
ON ON OA OA 



2 

CD 



— o n 

en cn — on- 
<-n on on > H 

-j on 

n o 
n o 
-j —on 

ON > H 

cj on 
n o 

H > 

H > 

• H > 

H > 

n o 
n o 
n o 

H > 

n o 

> h 

> H 

> H 

— o n 



o n 






o n 






n o 












o n 






> H — 


X 




o o — 


IT 2 




> H 


A) Q> 




H > 


(0 




n o 


H 




H > 


*-» 




H > 






o n 






n o 






on — 


CD C*3 CO CO 




o n 


W O O (0 




H > 


ft O H O 




n o 


52 30 *q 




n o 


, M 




n o 


W 1-4 l-t 




35 






no — 


dd *n 




o n 


cr 3 




H > 


< c 




no — 


03 X* 




o n 


m cr x n 




H > 


< c 


ON 


n o 




ON 


o n 


M X XO 


o n 






H > 






o n 






o n 







CD 2 
(0 3 O 

c h- a 

ON »-i 



a 
a 

fD 



CO 
O 



-J CO 
CO u> 



2 

3 



x k/) 

0i rr 
(D C 

1-4 H 



> H 
H > 

— H > ■ 

— no 

> H 
H > 

o n 

> H 
H > 

• H > 

-J — H > 

vo — n o 
so n> no 
u> on 
o n 

35 

-J — o o 

\Q • > H 

vO 



a 
a 

(0 

in 

HOT) 
b» (A 
OJ 

M 



X 

ft* X 
(D 0i 
(0 



ON 
00 



2 ►-». 

fit 

(D 



CO 
CO 

o 



00 

o 
o 



52 



EP 0 496 174 A1 



vo 



> 
H 
O 



> 
> 

O — 



VO VO vO 

vO ON <T» <T\ 

ON 00 00 00 
VO 



VO vo v£> 
«-J -J -*J 

cn en cn 



n 



SO 
VO 00 
00 u> 



VO 
00 



VO 

\o 



VO 
vO 



o 
o 



O 
O 
O 



o 
o 



o 

VO 



O 
LJ 



»- O 

o u> 

Ui ON 



O' o 
o n 

> h 

O O — 

o n — 
> 

O 

o o 

H > 

— O O — 
H > 

> H 

o n 
o O 

• no 
n o 
n o 

— > h — 
-no — 

o o 

— o o — 
o o 

> ^ 
o o 

• > h 

o o 

> H 

— O O — 

n o 

H > 

> H 

— O O — 
H > 

O O 

— o o - 

H > 

— O O - 
O O 
H > 
H > 

— o n - 
o o 

> H 

o o 

• H > 

H > 

H > 

»-3 > 

— O O - 

> H 
H > 
O O 

o o 

— o o - 

• h > 

n o 
n o 

H > 

n o 

H 
> 

. o n 

> H 

— O CD 
O O 
O CD 

o n 

— H > 

— on 

> H 
H > 

• no 



z 
a» 

hh z co co 
< o (0 n- z 
o o < h* 



a 2 c/> 

3 O C 



X 

(D a> 

CO 2 
M 3 



«1 
O 

~ 2 

3 

3 HH 

h- Z 

t— * 

M 0) 

2 M 

3 < 



2 
CD 



2 
3 



5 



GO 

vO 



o n 
o n 

> H 

o o 

> H 

o n 
n o 

O CD 

o n 

CD ao — H > — 

00 vO VO — O O — 

VP O O H > 

> H 

O O 

CD O 

— on — 
o o 

o o 

o o 



z z 

» w Z 
*0 T5 H* 
x ft* 

Ln 

IV) H H 
J> M 2 
M 3 



> 



> 

H 
H 



O O 
H > 



> 
> 

> 

O 
H 
> 



> 

H 
O 
> 



O O 

H > 

O O 

> H 

> H 

• on 
h > 

H > 

o o 
o o 

H > 
vo — O O 
ro on 
a\ H > 

> H 

• > H 

vo — on 

^ > H 

n o 
vo — o o 



en 



vo vo 
vo vo vo 



H > 

n o 

n o 

n o 

o n 

H > ■ 

n o 

o n 

o o 



37 



z 

2 

3 M 



CD ^ 
vT 3 
> < C 



vO 



— CD 

•o 

— N> fc/> 

oo o 2 
a\ p ft* 



A \o ^ — 

n) ^ >J 

vo a> — 

en 

o 



en 

















n — 


> 






X 


> 


H — 


< 


H 


i* 


3* 


O 


o 


ftj 


a* 


D 


O 


n 


o — 






50 2 






> 






-J 3 


M 


n 


o 






h- 




n 


o 






h» 




o 


n 










> 


H 










n 


o 










n 


o — 


X 








n 


o 










> 


H 


3* 








n 


o 


M 









o 
o 



VO 

o 



53 



EP 0 496 174 A1 



O 



8 



M A A U1 — 

i- 1 h-» o\ o\ as — 

& GO 



-n 
5 



en 



cn cn cn 



cn p 
oi a 3 
(t> p ft» 
to 



cd 

(A 

rr 
X 



00 



> H 
H > 
H > 
H > 

O O 

O O 

H > 

O O 

o o 
<n n - 

H > 

> H 

o o 
o n 
o n 
o o 

H > 
H > 

O O 

o o 
o o 
o 
n 

o o - 
o a - 
o o 

> ^ - 
n o - 
o o 
o o 
o o 

> H 

35 

> H - 

> H 
H > 
H > 
H > 

> H 
O O 
H > 

o a 
h > 

H > 

> H 
H > 

• o n 

- o n 

n o 
h > 

O O 

• o o 
o o 

> H 

o o 
n o 

■ H > 

- n o 
o o 
h > 

> H 

o n 

H > 

o o 
o n 
h > 

► o o 
o o 
o n 

> H 
o n 
o n 
n o 

.^5 

o o 

► o n 



z 

0» 



> H 
O O 
O O 

o n 

> H 
O O 

> H 

o o 

> H 

a o 

o n 



o 
cn 



n 

H 
> 

O 

53 



o 
> 

H 
O 



— a: 
o» 



CO 

O « 

Z 



2 

w 



W CO 
0 0 
o 1 X 

(D Q» 



O 

OA 

cn 



> 
H 
> 
H 
O 
> 

o — 



> 

35 

o 

H 
> 
O 
H 

<n 
> 

o o 

o o 

n o 

o o 

H > 

o o 

h- — > H — p 



H 
O 
> 

n 



O — > 
O CO »> 



00 o 



H 
H 



H 

> 
> 
> 
> 



2 
w 



tn > 
Z 



oa tn cn 
woo 
rr o m 

Z m 



o 



n o 

o o 

H > 

o o 

O O 

o o 

o o 

H > 

> H 

r> o 



— z 

0> 



— cn 

0) 

z 



o 

t- VD 
O -J 
vO 



O 



> 

C 



O O — >~» 2 

on hd 

> H — M h- X 

on t> 

H > M 3" 

0 0—3 

on a m 

o o 
on 
o n 
n > 
h > 

H > 
H > 

n o 
n o 
o n 

H > 

O O 

35 

H > 
H > 



o 
o 



O 



54 



EP 0 496 174 A1 



h* N) M — 
ro O O 
hOUI W- 
K> ON 
H-» O — 
N) 00 



> H 

o o 

a a 

o o 

o n 

n o 
o o 

> H 

o o 



> 



ps ro 

I0 < > 

0 e h- 

B CO 

h a 

h|-i h (D 2C 
HI 3 



2j 



00 



ro fsj 

Ln en 



H* h-» N> tn 
*-i NJ fsj OA \D 

h p w w ^ a\ o 

H N> M 0> d\ H H 
ro ON 0\ ro N> 
ON U U 



o o 
o o 

> H 

n o 

H > 

> H 

o o 

H > 

O O - 

O O 

H > 

n o 
o o 

H > 

O O 

o o 

H > 

O O 

o o 
o o 

H > 

H > 

> H 
H > 

O O 

- H > ■ 

o a 
a o 

> H 
H > 

o o 
h > 
-no 
o o 

<r> o 

- a o 

► H > 
H > 
O O 



CD ^ 

< c 

M X 



o 

H 
H 
H 
H 
H 

o 



o 
> 
> 
> 
> 
> 



o o 
n o 
a o 
o o 
o o 
n o 
o 
_ o 
h > 
n o 



-8 



> 

C 



> CO 

< ft* 

fll c 

m a\ M 
m n 



CO 

Di X 

C D> IDE 

h h oh 

M r-l i-t 

M HI 



to 
ro o 



00 

o 



a 
o 



> 

> 

n 
n o 
> h 
n o 
n o 
o n 



ro 
ao 
o 



— > 



Of 



0* 



55 



EP 0 496 174 A1 



en 



cn 



o\ o\ flh <T\ 
0\ G\ 0\ 0\ 



O 

7s 



n o 
o o 

> H 

> H 

n cd - 

H > 

n o 
n o 

> H 
H > 

n o 
o o 
h > 

no- 

H > 

H > 

> H 
O O 
H > 
O O 

> H 

> H 
O O 
H > 
H > 
O O 
O O 

> £ 

o o 
o o 
no 
h > 
o o 
n o 
o o 
h > 
o o 

n9 > 

o o 
o o 

53 

o o 
o o 

H > 



ve vo \o 
ut in in 



CD 
H 
O 
> 
O 

n 
o 

5 



> 

o 

H 
O 
O 
O 
H 
H . 

n o 

H > 

> H 

o n 
n o 

H > 

o n 
o n 

> H 

n o 
o o 
h > 

O O 

> 3 

o o 

o o 

H > 

n o 
o o 
o o 

> H 

o o 

> H 



3 



3 



CD M W W 

i» n o (D 

ft O M o 
Z 3D 



3 

3 



2 
3 



ho tr a 

3 0 C 

M HH > 



H > 

»- — o o - 

U> H > 
^ > H 
nj O O 
O O 
O O 

o o 

•35 

O O 

o n 
o o 

> H 
O CD 

> H 
O O 

> »-* 

• on 

> h 

H > 

o o 

> H 
H > 
H > 
H > 

35 

• > H 

H > 

o n 
h > 

35 

<-3 > 

> ^ 
— O O - 

on 
o o o 

O H > 

n o 
o o 
n o 

> H 

n o 
n o 

• on 

> H 
O O - 

^ > H 

^ O O 

H > 

O O 

n o 

(7) o 

o o 

> H 

n o 

> h 

H > 
H > 

> H 

o o 
o o 
o o 

H > ■ 

o o 
o n 

> H 
H > 

> H 
O O 
O O 
O O 

o o 
o o 



Z 



0D 



PO 

vo 



o 



H > 

> H 

o cn 
o o 
o o 

> H 

> H 

o o 

> H — 

> H 
H > 
H > 

> H 
O O 

o n — 
o n 
h > 
n o 
o n 

H > 

o n 

53 

O 

o — 

e 

o 

H 

O — 



CD 
W 
ft 
X 



> 

o 
o 
o 

H 

o 
n 
o 
> 

> 



o o 
> 
o n 
h > 
o o 
o o 

> H 



0> 
(D 



2 



»-» »-» u* — 
u u u ro 

NJ N NJ N) 

U) <-J U> 



cn 
c 

H H > 



o — > 
O — H- O 2 

> co tr 
3 o 

o 
> 

o 

H 

CD — 
> 

H 

n 
> 

CD 
(D 
> 
CD 
H 
H 
H 



> 
C 



u u u 

CP CP LP 
H H U Ul u- 

cn cn 



n O 

H > 

O CD 

H > 

CD O 

CD O 

H > 

O O 

cd n 

CD O 

> H 

O CD 

O CD 

CD O 

CD O 

CD O 



— cd cn co 
woo 
ft o n 

&) a* 

h hhID C 

HH VO 

OS 



cn 
o 



o 



o 



56 



EP 0 496 174 A1 



a 

(0 



o 

U) 



o 



o n 
h > 

> H 

o o 

> H 

o o 

> H 

o o 

H > 

a o 

> H 

o n 
o o 

> H 

> H 

o n 
• a o ■ 
o o 

> H 

o o 
o o 
o n 
-on 
o o 

> H 

o o 

- > H 

o o 

o o 
a n 

> H 



ON 
O 
ON 



2 
3 



2 
3 



3C hi 



> 

n 

H 

n 



> 
H 
H 
O 
> 

o 



33 



CO 

o 



4^ M H-( 



m 

o z w 

O H D» 

O ft* c 

H* vO 

o m <n 

vo < 



ISJ 

en 



a n 

H > 

H > 

H > 

n o ■ 

> H 
O O 
H > 

o o 

o o 

> H 

n o 

■ o n 

> H 
H > 
O O 



ft* 

CO 



2 

CD 



ON 

cn 



H 

o 

H 
O 
> 



> 

o 

H 



cn 
o 



cn 



o o 

o o 

n a 

o o 

h > 

> H 

> H 

> H 
O CI 

-no 

> H 
H > 

o n 

> H 
H > 
H > 
H > 

o n 



ON 

o 



50 
CO 

ft* 



ON 



> H 

> H 

n o 
n o 
n o 
o o — 

n o 

n o 

n o 

o o 

■ o o — 

n o 

> H 

> 5! 

o n 
o n 

> H 

> H 

o n 
o o 

H > 

n o 

> H 

> H 

> H 
H > 

n o 
o o 
► o <n 
o o 
n o 
o o 

n o 

n o 

n o 

•-3 > 

n o 

> h 

> H 

o o 

o o 

n o 

n o 
n o 

- o o - 

H > 

n o 
n o 
n o 

> H 

> H 

• O O 
H > 

o n 

> H 

o n 

> ^ 
n o 

> H 

> H 
O O 

• H > 

> H 

> H 
O O 
H > 

> H 

> H 

— O O - 
H > 

> H 

• o n 



ON 
CD 
O 



2 
3 



n 
o 
n 

H 

o 

H 

n 

H 

n 

8 



z 






I— 


H» 


H» 


ft* 


cn 


cn 




ca 


CO 


HI 


u> 


u 


< 







cn 

H» NO 

cn 
cn 

H» l\> 

Cn 
cn 
cn 



2 



Cn 
CD 
vo 



cn 

0D 



O 



*-* cn 
cn \o 

vO -J 
GD 



n 

— n 

H 
O 

o 

H 
H 

H 

— n 

• H 

H 

— O 
H 
H 

— n 

H 
H 

n 

H 

♦ H 

H 
H 

n 

H 
H 
> 
H 
H 
> 

> 

n 
n 

H 

n 

H 
H 
H 

n 

♦ O 

n 
o 
> 
n 

— o 

> 

— n 

n 
n 

O 
H 
H 

— H 

— H 

n 

• o 



o 
n 
o 

> 
O 
> 

o 
n 
o 
o 

o — x 



n 

H ^ 

n 

a m 
> 

H 

n 

H 

n 

> O 
O — 

> w 

> ^ 
o — 
> 
> 
o 
> 

> 

o 

H 
> 

H 
O 
O 

o 
> 

> 

o 
o 
o 
n 

H 
O 

> 

O — 
> w 

n m 

o M 

> 

o 

Is 



o 

hi O 



5 

O 

n 



ON 

o 
o 



3 

3 



> 
C 



57 



EP 0 496 174 A1 



vo 



O O 

o o 

n o 

> h 

> H 

cd n 

> H 

ID O 

> H 
H > 
O CD 
O O 

o o 
n o 
<d n 
h > 
n o 
o o 
• o o 

> H 

n cd 
h > 

H > 

n cd 
n o 

t-* H» — o o 



CO GO 
00 — 



00 -J 
-J o 



oo • 



H GD Q} 

00 00 00 

CO LJ U) 

J* 



**1 
c 



h-» » VO — 

\D H 
»-* t-* »— » O 



cd o 

H > 

o o 
o n - 
h > 

> H 

cd o 
cd o 
o o 

H > 

»-3 > 

<d o ■ 

> ^ 

o o 

o o ■ 
o cd - 

> H 

o cd 
o o 
cd o 
<d o 
o n 

> H 
O O 
O O 

> H 
O O 

o n 
o n 

H > 

o <d 

o o 

CD o 

> H 

> H 

cd n 

> H 

a o 
o o 
cd o 
o o 
o o 

H > 

O a 
n a 
o o 

> H 
CD O 
H > 

a o 



O 



2 

0) 



"■J -»J -J 



•J 

—J 



H > 

H > 

O O 

CD O 

O CD 

o o 

H > 

H > 

H > 

> »-3 
O O 

o o 

> H 

o n 

(D O 

o n 



O 
H 

o 
> 



> 

CD 
H 



o cd 



CD ^ 

cr s 
< c 

H S Ml "J 

o» o 

M 25 *• 



a 

CD 



O 
O 
H 



CD 
O 
> 



rsj 



VO 



—J 



03 Z 
3 Of 0) 



CD CD 
0\ O 
00 



K M H 00 

00 GO CO O 

H* O O O 

»-* oo cn Ln cn h* ■ 

oo o oo 

o -J o 

00 00 



00 
Ui 



0) 

(D oo n c/> 

o w n o 

rr o n 

M Z J) ^ 



GO 

»-* »-* »-* cn — 

00 00 00 

N> N> NJ • 

CO 00 CD 



TJ 

o 

■ 



O <D - 

> H 
H > 
H > 
O O 

a o - 
o n 

> ^ 

cd n 

> H 

n cd ■ 

> H 

O (D 

cd n 
> 

cd n 

o o 

o o 

cd n ■ 

o o 

H > ■ 

O (D 

O (D 

CD O 

CD O 

(D O 

O O 

> H 

n cd 

n o 
o n 
o o 

> H 

n cd 

CD O 

o cd 

H > 

(D O 

CD O 

> H 
CD O 

o n 

> H 

o o 

O CD 

O O 

H > 

O O 

> H 
O CD 

5 3 

O CD 



oo 
o 



> z c/> 

< H- 0) 
Hi Dl C 
VO 

H H ON 
M < 



3 
0* 
CD 



m 2 

M 3 



CD 
to 

no 



0 



00 

cn 

h 0 O 

o » 
rr 

XhZ 
0» 

(0 HH 



mi a 
m a 



2 

3 

00 iTl 

h io n 

rr c 

Z 77 



58 



EP 0 496 174 A1 



vo 



> H — X 

H > =r 

O 

n — m z 



VO 

cn 



CD 
CD 
H 
> 

o 



> 

CD 
> 



<d o 



vo — 

V0 U> U' 



vo 



H 
H 

n 
o 

?! 



> — 

a — 
o — 

H 
O 

> — 



0i 



?3 

» O 

o a 



vO vO U> 
U> U> CO 
VO VD 



O CD 

o n 

o o 

o o 

o o 



H 

o 



o 
> 
> 
o 

H 



X X 

or h- 



-n 
5 



vo 
cv 
on 



H 

O 
O 
O 

> 

CD 
> 

> 

o 

H 
H 
<D 
H 
> 
CD 
H 
CD 
CD 
H 



> 
> 

<D 
O 

n 
n 

H 

n 

H 
O 
> 
> 

o 
> 

H 

n 
> 
o 
o 
> 



• X 



o o 
o o 



vo vo »-» — 

-J vo 

VO H-» VO OD — 

vO O 

ao 



<D 
<D 

o 
n 
o 
n 
a 
> 
o 

CD 
H 



at cd z 
o o o 

3 H- 



c/) c/i 

O CO 
«3 



cn x 

3 3 CD X 2 Z cn Co 
dioiO'OooniD 

H H 



vo vo 
CD CD 



vo h* — 

GO VO 
ON GO 



O Q 

n o 

> H 

o n 

O CD 

H > 

<d n 



— © 

— tr 

(0 < 

— rr 00 

— > 

Q» O 



CO 
Of 



3 

o 




59 



EP 0 496 174 A1 



C436I/I782) 

r amp iEcoRI 

' pBP ^ ^ 

Bam HI LINKERy 

(2065> 
Bel l(2770>. 
Bam HIC2533)- , 

Bel IC2770)' ^ - , „ 
Bam HI(2533)T3 j ,» B 9 l " 
Hind IH (47 1 o) 
LINKEFT f 1 




Rl AUG R| H2 UAA (A) n 

-17 52 639 1354 1702 1810 

\ 1 "j'l"! Kp Sm 

H3 30 99 
Bs Xb 

v. 



L 1747 1841 

pJD20l sp Rl 



FIG. 5 



60 



EP 0 496 174 A1 




61 



EP 0 496 174 A1 



VILLIN promoter 



oncogene; 





OVOCYTE 



TRANSGENIC MICE 



Tag 
ras 

myc 
p53 




c 



INDUCTION OF TUMORS IN DIGESTIVE EPITHELIUMS 
WHICH EXPRESS VILLIN , 




CELL CULTURE 



DEVELOPMENT 
AN ANIMAL MODEL FOR COLO-RECTAL 
CANCERS 



I 



NEW CELL LINES DERIVING FROM 
MOUSE DIGESTIVE EPITHELIUMS 



FIG. 7 



62 



EP 0 496 174 A1 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



AppMcitlon Number 



EP 91 40 2887 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document wkfa indication, where appropriate, 
of referent pa 



Relevant 

to< 



CLASSIFICATION OF THE 
APPLICATION Got. CL5 ) 



No relevant documents disclosed 



C12N15/00 
C12N15/85 
C12N15/12 
A07K67/027 



TECHNICAL FIELDS 
SEARCHED (Int. CLS ) 



C12N 
C07K 



The present search report has been drawn up for all claims 



Place «f MYCfc 


Data af caaarfotiM aftai mk 




THE HAGUE 


24 FEBRUARY 1992 


CHArlBONNET F.J. 



CATEGORY OF CITED DOCUMENTS 

X : particularly relevant if Uken alone 

Y : particularly relevant If combined with another 

document of the suae category 
A : technological background 
O : noo-writteo disclosure 
F : intermediate cocuaent 



T : theory or principle underlying the Invention 
E : earlier patent docament, but publishes on, or 

after the filing date 
D : eocument dted In the application 
L : eocuaent efted for other reasons 



of the s 



ft patent family, corresponding 



63 



